Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix TensorRT-LLM #7142

Merged
merged 7 commits into from
Apr 23, 2024
Merged

Fix TensorRT-LLM #7142

merged 7 commits into from
Apr 23, 2024

Conversation

mc-nv
Copy link
Contributor

@mc-nv mc-nv commented Apr 20, 2024

No description provided.

@mc-nv mc-nv changed the title Krish 24.04 Fix TensorRT-LLM Apr 20, 2024
Tabrizian
Tabrizian previously approved these changes Apr 20, 2024
Copy link
Member

@Tabrizian Tabrizian left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@krishung5 might be more familiar with these changes.

@@ -1708,6 +1708,30 @@ def tensorrtllm_prebuild(cmake_script):
# Export the TRT_ROOT environment variable
cmake_script.cmd("export TRT_ROOT=/usr/local/tensorrt")
cmake_script.cmd("export ARCH=$(uname -m)")
cmake_script.cmd(
'export LD_LIBRARY_PATH="/usr/local/cuda/compat/lib.real:${LD_LIBRARY_PATH}"'
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why is this line required?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Because libcuda.so.1 can't be resolved till --runtime=nvidia is passed.
It's a workaround that allows us refer to the links in build container, can be deprecated in future.

You can create task and assign to me.

build.py Show resolved Hide resolved
build.py Show resolved Hide resolved
@mc-nv mc-nv merged commit bdab292 into r24.04 Apr 23, 2024
3 checks passed
@mc-nv mc-nv deleted the krish-24.04 branch April 23, 2024 02:20
mc-nv added a commit that referenced this pull request Apr 25, 2024
* TRT-LLM build

* Update versions

* Remove statment, as unused

* Remove cache

* add cmake option to set CXX11 ABI

* Mchornyi krish 24.04 (#7149)

* Enable TensorRT-LLM build outside of CMake

* TensorRT-LLM requires lower version of cuDNN

* Format

---------

Co-authored-by: krishung5 <[email protected]>
mc-nv added a commit that referenced this pull request Apr 25, 2024
* TRT-LLM build

* Update versions

* Remove statment, as unused

* Remove cache

* add cmake option to set CXX11 ABI

* Mchornyi krish 24.04 (#7149)

* Enable TensorRT-LLM build outside of CMake

* TensorRT-LLM requires lower version of cuDNN

* Format

---------

Co-authored-by: krishung5 <[email protected]>
mc-nv added a commit that referenced this pull request Apr 30, 2024
* Fix TensorRT-LLM (#7142)

* TRT-LLM build

* Update versions

* Remove statment, as unused

* Remove cache

* add cmake option to set CXX11 ABI

* Mchornyi krish 24.04 (#7149)

* Enable TensorRT-LLM build outside of CMake

* TensorRT-LLM requires lower version of cuDNN

* Format

---------

Co-authored-by: krishung5 <[email protected]>

* Update README and versions for 2.45.0 / 24.04 (#7096)

* Update README and versions for 2.45.0 / 24.04

* Update ONNX Runtime version - 1.17.3

---------

Co-authored-by: krishung5 <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

3 participants